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Generalized solution for the Herman Protocol Conjecture 

Endre Csoka* Szabolcs Meszaros^ 


Abstract 

We have a cycle of N nodes and there is a token on an odd number of nodes. At each step, each token 
independently moves to its clockwise neighbor or stays at its position with probability If two tokens arrive 
to the same node, then we remove both of them. The process ends when only one token remains. The question 
is that for a fixed N, which is the initial configuration that maximizes the expected number of steps E{T). The 
Herman Protocol Conjecture says that the 3-token configuration with distances [yj and maximizes E{T). 
We present a proof of this conjecture not only for E{T) but also for E(^f(T)^ for some function / : N —^ R"*" 
which method applies for different generalizations of the problem. 

Keywords: Stochastic processes, Discrete optimization, Random algorithms, Stochastic optimization. 


1 Introduction 


The simplified setup of Herman’s self-stabilizing algorithm consists of a directed circular graph of N elements 
and k tokens put on some of the nodes of the graph. The vertices represent identical processes connected along the 
edges. Ideally, if the system is in a legitimate state, only one process holds a token in the configuration. However, 
errors may occur when the system enters into a multiple token state. Herman’s algorithm is a randomized protocol 
to reach a one-token state after an error, hence the name self-stabilizing. 

The method of the algorithm is the following: in every step of the discretely treated time, if a process holds 
a token then it keeps it with probability ^ or passes it to its directed neighbor (say, clockwise) with probability 
independently of the other token-passes. If a process kept its token in a step but also receives one then both 
tokens disappear. By the implementation of the processes, we can guarantee that Herman’s algorithm starts at a 
configuration where there are odd number of tokens, hence the mentioned algorithm will eventually yield a one-token 
state with probability one. 

Several questions naturally arise about the distribution of the execution time of the self-stabilization, i.e. the 
hitting time of a one-state configuration, what we will denote by T, following the notation of |KMOWW12] . Since 
the complete description of the distribution P(T < t) did not turn out to be a really accessible question, the analysis 
focused mainly on the derived quantity E{T). The denominator, Herman, proved a bound ^N'^logN on £’(T) in 
the original paper |H90] . that got repaired to 0{N^) by multiple independent authors |FMP05I IMM051 lN05] . 

To find a tight bound, it is reasonable to search for the extrema of £’(T) as a function of the initial configuration 
of the tokens. Assuming that the stabilization starts with 3 tokens, the maximum of E{T) is realized on the 
equidistant starting position of the tokens (or the closest configuration to that, if N is not divisible by 3). This is a 
consequence of the description of E{T) given by |MM05] for all the initial configurations with three tokens. They 
found an explicit formula for E{T) in terms of the “distances” of the tokens, where by distance of the tokens Xi and 
X 2 we mean the length of the arc connecting Xi and X 2 but avoiding the third token A3. Given these distances of 
the tokens a, 6, c G N (where necessarily a-|-6-l-c = iVby definition), the expectation of T can be expressed as 


E{T) 
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what clearly has the maximum at the points where a, b, c are the nearest integers around y summing up to N. In 
particular, a = 6 = c= y if iV is divisible by 3. In |MM05] . it was also conjectured to be the only maximum of 
E{T) considering all the possible initial configuration, not necessarily with three tokens. In this paper we give a 
proof of this conjecture. 

A way for further investigation of the distribution of T would be to show that even P(T < t) has the minimum 
at the equidistant three-token starting state. This is really a finer information about T since £'(T) is the sum of 
P(T > t)’s and a tight bound on i?(T) can be understood as the sum of tight bounds on P(T >t). In |KMOWW 12] . 
an explicit formula was established on P(T < t) assuming that there are three tokens. As a consequence, they 
deduced that the minimum of P(T < t) is indeed realized at the equidistant three-token starting state when we 
consider the three-token initial states. The next step could be to obtain this theorem with no restriction on the 
number of tokens. 


In the paper, we first show a bound on another linear combination of P(T > t)’s, namely that 


< 3 
— 2 


where e = sin^ (^)j with equality only if we start from the equidistant three-token configuration. Although this 
is not enough to show that the maximum of P(T) is at the mentioned equidistant state but it is another evidence 


towards the hope that P(T < t) is minimized by the equidistant three-token configuration. Indeed, PI I ) ) 


is also a linear combination of P(T > t)’s but with weights (1 — e)“* — (1 — — g)-* instead of I’s, as 

in the case of P(T), hence the inequality is now a weighted sum of tight inequalities on P(T > t). 

To establish this result, we first deduce a recursion on the expected evolution of some kind of a potential of 
the process. The recursion can also be used to prove similar tight inequalities on several other linear combinations 
of P(T > t)’s if the weights are sufficiently well-behaved. All such estimations are tight only in the case of the 
equidistant initial configuration, giving yet another evidence to the general conjecture about P(T < t). As an 
example of this method, we prove the case of constant 1 weight, namely, we prove that P(T) < and that 

it is maximized only by the equidistant initial configuration. The used argument is not so combinatorial in its 
intrinsic nature hence it can be generalized to the case when the tokens take steps by Poisson clocks or when the 
circular graph is replaced by a continuous circle on which the tokens follow the distribution of a Brownian motion. 
Also, the proof of Theorem |2.5| shows that if there are more than three tokens then the expected growth of the 
potential is significantly bigger compared to the decrease of P(T) and this extra freedom can be used to generalize 
the statement to P(/(T)) for functions what are “close enough” to P(T) or E{a^). 

The organization of the paper is the following: we first develop the main ideas of the paper then we give the 
main steps of the proof of the two mentioned theorems postponing some computation. These technical details and 
the equation on the evolution of the potential are discussed in the third section. 

The authors are thankful to Andrzej Murawski who drew the attention to the Herman Protocol conjecture at 
the first Problem solving session of the DIMAP Retreat, in March 2013. The key idea of this solution. Theorem 
|2.5| and the sketch of the modification leading to Theorem |2.1| was explained in the second Problem solving session. 
Another solution of the conjecture using independent techniques is shown in |BGKOW15] . 


2 Main results 

First, we introduce the notations but using a bit modified viewpoint on the described process. Since the 
arguments require some symmetry, for our purposes it is better to rotate the base space by after every step 
counter-clockwise, where N stands for the number of nodes. This slight notational modification have the effect 
that the number of nodes gets doubled, but half of them is necessarily avoided by the tokens in every step. In the 
following we will refer to this 2N as the number of nodes. Moreover, the tokens now move in a symmetrized way: 
they either go on the clockwise neighboring new node with probability y or move in the opposite direction to the 
counter-clockwise neighbor with probability all independently. 

The (possibly new) nodes will be numbered by 1,2,... ,2N. The location of the j-th token at time t G N>o is 
described by the random variable Xt(j) where j = 1,2,Kt and Kt stands for the number of tokens at time t, 
where the tokens are numbered compatible to their ordering on the circle (but the beginning of the enumeration is 
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arbitrary). Generalizing the notation we will write Kt{x) for the (random) number of tokens at time t for the 
process starting at the initial state x. In particular K{x) := Kq(x) denotes the number of tokens at state x. As in 
the introduction T := T(a:) := minjt | Kt{x) = 1} is the hitting time of a one-token state, i.e. the execution time 
of the self-stabilizing algorithm. Note that this notion is not affected by the symmetrization of the process. Also, 
we need a notation for the hitting time of a three-token state what will turn out to be a major turning point in the 
evolution of the process, so r := min{< | Kt{x) = 1}. We will refer to the equidistant three-token configuration in 
this way without elaborating on the cases where 2N is not divisible by 3. 

The final goal of the paper is to prove 

Theorem 2.1. AT < with equality if and only if we start from the equidistant three-token state. 

To verify this bound, it seems natural not only to keep count on when the process terminated but to have a way 
to measure “how far” we are from the end in expectation. Then the goal becomes to show that the this measure is 
the worst (i.e. the highest) through the whole process if and only if the initial state is the equidistant three-token 
state. So we define a “potential” that grows from zero to one as the process gets closer to its final state. We will 
denote the first such potential by <i> what is defined in an obvious way: assign to a state x the expected value of 
the remaining time until it terminates, and rescale it into [0,1]: 


<i)(a;) := 1 — 


ig(T(x)) 

m£iXy,K{y)<3 Ey{T{y)) 


for all state x. By |MO05| . if x is a configuration with three tokens, then <!> gets the form 


^(a;) = 1 - T 


27 


iV2 


= 1 - 




- 27abc 

]V3 


where a, b and c are the “distances” between the three tokens at state x, i.e. the number of nodes on the arcs 
connecting two tokens while avoiding the third. (Note that this formula is obtained in the non-symmetrized setup 
hence it counts only the original nodes; with the new nodes a show be |.) The main idea of the proof is to 
introduce a new “potential” x d'(x) G [0,1], so something that measures how far our state is from the final state 
in expectation, whose growth speed can be estimated without trying to compute the original d* potential for all the 
configurations with arbitrary number of tokens, what would not seem a doable quest. 

In the definition of d' we will use the complex exponential function k i—> The purpose of it is just 

avoiding a new notation for the function which assigns the corresponding unit vector to the nodes, when the ring 
is embedded into the complex plane. However, this notation also contains implicitly an identification of the plane 
with the complex plane (what identification in principle got chosen when we numbered the nodes). In other words, 
we secretly chose a fixed direction what we should not totally forget about. 

Definition 2.2. Let x be an arbitrary state and assume that 1 < x(l) < x(2) < • • • < x{N) < N where x(j) is 
the position of the j’th token of the state x using counter-clockwise enumeration of the tokens starting at direction 
1 € C. Then the potential d' is defined as 


K{x) 

^-(a;) := ^ 


If the node N is not between x{N) and x(l) then we first re-enumerate the tokens to fit the assumption. We will 
see in Proposition A.l that neither the choice of the direction, nor the re-enumeration of the vectors affect 'I'(x), 
even though the summands separately depend on those. 


Geometrically, x i—> 'I'(x) can be described as summing up the (directed) angle bisectors of the vectors 
(maybe after renumbering) and the fixed unit vector 1 with an extra twist: for odd j we reflect the resulting angle 
bisector vector to the origin. Informally, this reflection is applied to stabilize the quantity under the disappearance 
of two colliding tokens. Formally, it means that if x(j) = x{j 1) then deleting these two tokens from the vector x 
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will not change the value of ^'(a;). Also the alternating sign is responsible for the independence of dt to the choice 
of the direction. These issues are discussed in details in the Appendix, but we state here the properties of dt we 
plan to use in this section: 



Proposition 2.3. For any state x we have 

1. The choice that we made at the identification of the plane with C does not affect In other words, ^'(a:) 

is invariant under the simultaneous translation ofx{j)’s, even if during the translation, a token jumps over 
1 e C. 

2. The disappearance of two token when they meet do not affect 'I'(a;). 

3. 'I'(a;) < 1, with equality if and only if there is only one token at state x. 

4- 'I'(a:) > 0, with equality if and only if x is the equidistant (not necessarily three-token) configuration. 

Now, we fix the initial state x of the process t ^ Xt. Let us denote by Yt = 'I'(A*) the value of the potential 
defined above on the random process at time t. The evolution of Yt is described by the following lemma: 

Lemma 2.4. For allt^fj the following holds: 


E{Yt+i I Ft) = (1 - e)Yt + e-Kt 

where Ft := a{Xt{j) | s < t, j < Kq) is the standard filtration of the process, e := sin^ (jn)' ~ Kt{x) is 

the number of tokens at time t. 


This Lemma is proved in the Appendix, although it is a crucial step toward the theorem. To apply this formula, 
the first idea is to iterate this recursion under expectation. The following theorem shows the result of this approach: 

Theorem 2.5. For e = sin^ (jn) following holds 



with equality if and only if we start from the equidistant three-token configuration. 
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Note that this statement is a tight inequality with the sum of P(T > t)’s with the weights (1 — e) * — (1 — 
£)-(*-!) = on the left hand side. 

Proof. Fix a time t G N and expand EfYt) = E(^E{. . .E{E{Yt \ Et-i) \ Et- 2 ) ■ ■ ■\ -Fq) ■ ■ •)) using Lemma 

t-i 

E{Yt) = (1 - efEfYo) +eJ2i^ - eY-^-^E{K,) = 


2.4 


s =0 


Here, we may collect the times s with the same Kg. Note that in the following sum the index “runs backwards in 
time” what may disturb the reader’s visualization of the computations. 

/ Ko 

= {l-eyE{Yo) + E(Y, E h-e{l-eY 




h—0 s:Ks—h 


where s runs only up to t — 1. To write the second sum in a compact form, we introduce the (maximized) hitting 
times of the /i-token states as follows: 

Lfi := min |t, min{s | the number of tokens at the s-th moment is /i}} 

for all odd h. For simpler notations, we will write Lh = 0 for even h and L_i := t. Clearly, Li = min(T,t) and 
L 3 = min(r, t). This way, the second expectation gets the form 

/ Kq L h_2- 1 . 

^ h^l s^Lh ^ 

_ ^ n - - n - 


where the latter sum is in fact telescopic since e(l — e)* ^ ® = (1 — e) 


{1 — sY ® yielding 


Ko 


= e( Y,h-({l-e) 


\t — Lh-2 


h^l 




Now, collecting the terms by the exponents of 1 — 5 gives 


/ Kq — 2 \ 

E{Yt) = (1 - eYE{Yo) + FI I (1 - - Ko{l - (h + 2 - h) ■ {1 - j 

h^l ' 


iCo-2 

= (1 - eYE{YY) + 1 - KY\ - e)‘ + 2 ^ E{{1 - 

/t=i 

So we got a formula for E{Yt), although it is not closed. However, we can rearrange the expression to estimate 
if((l — = E(^{1 — as follows: 


^((1 - - 
^ ^ h^3 


<0+^{Ko-E{Yo)) - 0 


Ko-3 ^ _ 3 - E{Yo) 
2 ' 2 


The right hand side of the resulting inequality does not depend on t, hence, if we take the (non-decreasing) limit 
in t under the expectation on the left hand side, then the inequality remains valid. This proved the first part of the 
statement. 

For the equality case, note that we used estimations only in the last step so the case of equality holds exactly 
if we did not lose anything at this estimation. Namely, it is equivalent to E{Yt) = 1 and i?((l — e)“^'*) = 1 for 
all 3 < /i < Kq — 2. The latter equality means Kq = 3 so we are in a three-token configuration, and the first 
one - by Lemma ED - is equivalent to the assumption that the tokens are distributed equidistantly, proving the 
statement. □ 
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As we mentioned in the introduction i?((l — is not in an obvious relation with i?(T) so we need to 

essentially modify the argument above to deduce Theorem |2.1[ The nontrivial problem in the background is that 
the potential 'I' does not grow fast enough in conditional expectation if there are three tokens, but it does if there are 
more. So the idea of the correction is to change the potential at the (random) moment when we enter a three-token 
state. In fact, we will return to the original potential d) what already worked well in the three-tokens states, i.e. it 
is computable then. Going back to Lemma [2.4[ we would like the conclude a relation for the expected change of 
the potential after one step, if we still have at least 5 tokens. In formulas, we need the following: 

Corollary 2.6. E{Yt) > 4e£'(r) + Yq with equality if and only if Kq < 5. 

Proof. First observe that Lemma [2.4| implies 

E(Yt.^.l\iF t) ■ l(t<r) = ((1 ~ + s ■ Kt) ■ l(t<r) > (Xt — £ ■ ^ + s ■ 5) ■ l(t<r) = Xt + 4e) • l(t<T) 

since Kt > 5 on the event {t < t) and 0 < fy < 1 by Proposition |2.3[ So we can apply the usual argument to get a 
bound on Ify: 

OO 

Yr-Yo = - Yt) ■ l^t<r) 

t^O 

We can take the expectation of it, where interchanging E{.) with the sum is possible since 

n 

^t) ‘ l(i<r) — h^(n+l)Ar hg € [ 

t^Q 

SO we have can apply Lebesgue’s theorem. Hence, 


EXr - fy )) = E ^((^‘+1 - • 1 (‘<-)) = 


where we apply the Law of Total Expectation: 

OO OO 

= Y,E[E{Xt+i - Yt) ■ l^t<r) I A)) = E ^(^((^*+1 - I E ■ lp<r)) > 
By the estimation described in the beginning of the proof, we get 

OO 

>^F;(4e.lp<,)) =4 £.f;(t) 
t=o 


as we stated. 


□ 


This lower bound will be enough to prove that no initial configuration can yield slower growth (in expected 
value) of Y than the equidistant three-token configuration until we reach a three-token state. The problems is still 
that such an estimation does not work on (t > r) (there we would get 2 instead of 4) when we have only three 
tokens and Y may slow down. 

So to solve this, as we mentioned, we have to switch back to 4) from 'k at the hitting time of a state with three 
tokens since the expectation of 4) grows “fast enough” after r as we will see. The second problem is whether the 
mentioned shift of potentials can carried out without major loss in expectation. If it can then we can conclude that 
- starting from any initial configuration - this mixed potential grows at least as fast as the potential of the process 
starting at the equidistant three-token configuration. 

To establish a relation of the two potentials we will need the following lemma, which is proved in the Appendix: 


Lemma 2.7. 


c > 


27 


— 0.9-47r2 


For any state x with three tokens the following inequality holds: 4>(a;) > c • 4'(a::) for some c where 
0.7599. 
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Now, the main theorem follows assuming this Lemma. 


Proof, (of Theorem 2.1) First, let’s investigate what happens to the potential at the moment of the potential- 
interchange: 


= 1 - 


E{T{x))U=x. 


T[iaXy,K(y )<3 Ey(fl{y)) 

Hence, taking expectation gives 


= 1 - 


27 


—iV2 
27^' 


E{TiX^)\X^) = l-—E{T-T\Xr) 


27 


E{HXr)) = l-^E{T-r) 


So now, we can estimate ET as: 


ET — Et + i?(T — r) = Et - 


47V2 


(l-Ei^Xr))) 


< 


where we can apply Lemma 2.7 


< ifr + ^ . (l - cF;(vI/(X.))) =Et+^-{i- cE{Y,)) < 
So we can use the estimation of proved in Corollary |2.6| 


4^2 t , ,\ 4iV2 ( 16 n \ 4^2 

<EtY — • (l - c • (4£F;(t) Fo)) = - 7 ^ 7 - 


where we can use the value of e = sin^ 
continue as 


4iV2 

< - 

- 27 


(aw) ^ • 0.9 if fV > 3 since sin^ (|) 



0.9 • 47r2 
27 




4iV2 


0.9119 • (l)- Hence, we can 


by c > Therefore, we got the inequality part of the statement. To see the case of equality, note that in 

the last inequality we estimated from below E(t) by zero and To by zero as well. If we did not lose anything here 
then E(t) = 0 hence we start from a three-token state and also Fq = 0 hence we started from the equidistant 
configuration by |2.3| □ 
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A Properties of the potential 

In this section, we will establish the technical lemmas on 'h, its expected evolution and its relation to the original 
potential 

Proposition A.l. For any state x we have 

1. The choice that we made at the identification of the plane with C does not affect 'I'(a:). In other words, 

is invariant under the simultaneous translation of x{j)’s, even if during the translation, a token jumps over 
1 G C. 

2. The disappearance of two token when they meet do not affect 

3. 'I'(x) < 1, with equality if and only if there is only one token at state x. 

4- > 0, with equality if and only if x is the equidistant (not necessarily three-token) configuration. 

Remark A.2. Thanks to first statement, we may always assume that in a fixed step, the node 1 is not involved 
and the tokens are enumerated in increasing order starting at the node 1 , hence the computations in the previous 
section are completely valid. 

Proof. For the first statement, note that a 27r^ rotation of the choice of 1 G C corresponds to a multiplication of 
every vector by a length 1 complex number. Assuming that x{j) < 2N implies x{j) k < 2N i.e. no token jumps 
over the edge 2 A- 1 , it will not change 'I'(a:) since 


K{x) 


K{x) 

1=1 


K{x) 

1=1 


where x* stands for the renumbered sequence. So it is enough to consider the case when x{jo) = 2N, 1 < a;(jo + l) < 
2N + 1 and we rotate the nodes and vectors by the angle 27r^ i.e. by one token so that x{jfi) = 1 = 2N + 1. That 
rotation changes two things: the way we have to enumerate the tokens and also the way we count x{jo). Indeed, 
the numbering is shifted by one and x{jo) is now 1 not 2N + 1. Therefore, 


K(x) 

e^5(^+i)*(l)(_l)l 

1=1 


K{x) 

glV (1)^— ^ ^ glV 2 (^+ 1 ) (1 )^—Tjt 

1=2 


K(x)-1 


K{x)-1 


1)+ U)+P =|(—Ijewaj -llT '^2 e« 2^ 1)-^ 

1=1 1=1 


K(x)-1 

fh^yK{x))^_.^-^Kix)^ e^5==*(i)(_i)i = y^ gw5==*(i)(_i)i 

1=1 


Kix) 


1=1 


verifying the first statement. 

The second statement is clear, since if x{j) = x{j + 1) then they represent the same vector contributing to the 
sum with different signs so their sum is zero: 


gW5(®(l)+'=)(_l)l -g gW5(®(l + l)+'=)(_l)l + l = 0 


and the change of the enumeration (every index after j + 1 decreases by 2 ) will not change anything because 

(_ 1 )/ = (_l)^- 2 . 



To prove the third statement, we may assume - by the first part - that the last token x{K{xj) = 2N so the 
corresponding unit vector is the base direction 1 € C. Then we can pair the remaining tokens so that the 

sum of the corresponding vectors can be expressed as 


K(x) 

i=i 


K{x)-l 
1 + ^ 
t=i 


K{x)-1 

fc=i 


where each can be pictured as the the vector starting at the the end-point of the angle 

bisector unit vector and ending at the endpoint of so it is a chord of the unit-circle. By that 

picture, it is clear that the sum of several directed chords on a half-circle plus the vector (1,0) will not sum up to 
vector longer than 1. Indeed, the sum of their projection to any line in the plane is has length at most 1 what is 
equivalent to having length at most 1. The case of equality means that there is a projection such that it has length 
1 in one direction but since the chords must not meet (since x{j) yf x{j + 1) for all j) and also no angle bisector 
can end at —1, it is possible only if that projection is on a horizontal line, so there is no token besides the one on 
the node 2N. 

The inequality in the fourth statement is obvious by |.| > 0 and the case of equality can be investigated by the 
same geometric argument as in the third statement. Namely, it is the zero vector if and only if all of its projections 
are zero. □ 


A.l Growth of ^ 


Lemma A.3. For all t gN the following holds: 

E{Yt+i\Ft) = {l-e)-Yt + e-Kt 

where Tt '■= I s < L J < Kq) is the standard filtration of the process, e := sin^ (jn)’ “ Kt{x) is 

the number of tokens at time t. 

Proof. First, we investigate what happens to the corresponding angle bisector of one token at a step. Let 

us use the notation dt{j) := Xt+i{j) — Xt{j) G { — 1,1} for the step of X(j) at the t-th moment. Then we can 
decompose the corresponding unit vector as follows: 


e2jv^‘+i(i) = g2’jv(^t(i)+<^t(i)) = oos _|_ gjjj =■ Ct(j) + St{j) 

Geometrically, a step of the vector can be described as shortening the original vector and adding a small orthogonal 
vector to it. 

Note that the resulting terms get simplified under conditional expectation: On one hand, E{St{j)) = 0 since 
St{j) = -St{j) Moreover, 


E{St{j) I J-0 = e5^^‘«ii;(^sin(^^) | 



= 0 


by the independence of Et and dt{j). On the other hand, Ct{j) does not depend on dt{j) since dt{j) has values in 
{1,-1} so 


C't(j) cos = cos i.e. E{Ct{j) \ Et) = Ct{j) 

Therefore, we can compute the growth of Y* after a step: to expand the |.p we use the notation {v,w) = Re(ur(;) 
for the ordinary Euclidean scalar product on the plane. Hence, we can write: 


/ Kt Kt . 

E(rt+i| J-*) = 


9 



Kt Kt 

= EE E(^{-ir+»{Ct+i{a) + 5t+i(a), a+i(6) + St+i{b)) | Et) 


a—1 b—1 

where most of the terms of this sum cancels out since 


E{{Ct+iia), I Et) = E{cos (^dt(a)) sin (^^*(6)) | J-*) = 0 

E{{St+,{a),Ct+^m I Et) = E{cos (^dt(&)) sin (^rft(a)) | J-*) = 0 
by the independence of (it(j)’s and the asymmetry of St{j). While the other terms get the form 

£;((a+i(a),a+i( 6 )) I J-*) =cos^ 

E{{S,+,{a),St+db)) I ^0 = 5a,6 • sin2 

by again the independence of dt(j)’s and E{St{j) \ Et) = 0- Therefore, we get 


Kt Kt 


i?(y*+i I j-i) = EE(-i) 


cos^ 


a=l b—1 




Kt 


+Esi 




= cos" - YtEKf sin^ = {\ - e)- e ■ Kt 


as we stated. □ 

A.2 Estimation for the change of potential 

The values of the two potential in the three token case can be explicitly by real analytic functions, hence to 
establish an inequality on their relation we use elementary function analysis. 

Lemma A.4. For any state x with three tokens the following inequality holds: <i>(a;) > c • for some c where 


c > 


27 


- 0.9-4:77^ 


0.7599. 


Proof. Let us denote the “distance” of the tokens of state x (i.e. the number of original nodes on the arc connecting 
two tokens avoiding the third) by a, b and c as in |KMOWW12j . s o a, 6, c G N} (not 2N) and a + b + c= N. 


Then <i)(a;) = ^ as we saw after the statement of Theorem 2.1 Besides, 


v]/(a;) = (—= 
= 3 - 2Re( - 2Re^e5^ j + 2Re(^e^ 


= 3-2COS (;^(a;(l) - 


- 2 ( 


y 2 ^v-v-/ -cos(^(x(2)-x(3)))+2cos(^(a:(l)-; 

„ „ /7ra\ „ /7r6\ „ /7r(2A —c)\ 

= 3 - 2 cos (-) - 2 cos (-) + 2 cos (- - -) 


Notice that both expressions vanish at a = 6 = ^ so we re-parametrize them: let w = ^ € 

similarly v = ^ — I G 


1 2 
3 ’ 3 


and 


1 2 
3 ’ 3 


. By this, potential $ gets the form 

4.(x) = l-27(i+«)(l+»)(i-„-,.) 
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= 1 — ((1 — 3u — Sf;) + (3ti — 9vu — 9v^) + (3m — 9m^ — 9mm) + (9mm — 27v?v — 27mm^)) = 


= 9mm + 9m^ + 9m^ + 27v?v + 27uv^ = - [v? + m^ + (m + m)^) + 27(m^m + mm^) 


While the second potential: 


'I'(a:) = 3 — 2cos ~ 2cos ^ = 


= 3 - 2 ( ^ cos(7rM) - ^ sin(7rM)) - 2 f ^ 


cos(7rM) — ^ sin(7rM)^ ~ {T^iu + v)) — ^ sin (7r(M + m))^ 


= 3 — cos(7rM) — cos(7rM) — cos [ tt^u + m)) + '\/3( sin(7rM) + sin(7rM) — sin (7r(M + m))) = 

= 3 — cos(7rM) — cos(7rM) — cos [ tt{u + m)) + -s/S^sin(7rM) (l — cos(7rM)) + sin(7rM)(l — cos(7rM))^ 


Therefore, the statement is equivalent to 


(5(m,m) := 


I (m^ + m^ + (m + m)^) + 27(m^m + MM^) 


3 — cos(7rM) — cos(7rM) — cos (^tt{u + v)) + ^/3( sin(7rM)(l — cos(7rM)) + sin(7rM)(l — cos(7rM 


> c 


for all (m, m) G 


1 2 

3’ 3 


assuming m + m < . 


For a more compact form of Q we will briefly write f{x) := ^ and g{x) := where / and g are even 


function with f{x) G 


27 Try 
8 ’ 2 


and g{x) G 


3V3 , 
4 ’' 


if X G 


1 2 

3> 3 


. By these notations 


Q(m,m) = 


I (m^ + m^ + (m + m)^) + 27 (m^m + MM^) 


f{u)v? + /(m)m2 + /(m + m)(m + vY + 'Ji[g{u) f {v)uv'^ + g{v)f{u)u'^vj 




Figure 2: The function 


2 


1 2 
3 ’ 3 


9 (m, m) —>■ Q{u,v) about what we should show that it is at least 0.7599. 


To verify the statement, first we compute the limit of Q around the origin, but in a constructive way. Namely, 
we have to give an explicit e-S so we can omit the ^neighborhood of the origin in the further computation. In 
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fact, the origin is the only place where both the numerator and the denominator vanishes in the region 1(^,1;) 

u + V < 11 hence on the remaining part of the domain (outside of a neighborhood of the origin), it 


1 2 
3’ 3 


is enough to give an upper bound on the length of the derivative of Q. Knowing that bound, it will be enough to 
compute finitely many values (what are strictly bigger than c) of the fraction by some mathematical software: the 
estimation of the gradient will prevent the function going below c on the whole domain. 

So first, we compute the limit of the fraction along the lines v = Xu for some positive A. We may assume that 
A < 1 since the function is symmetric. Let 0 < u < S where S is specified later and substitute into Q{u, v) where it 
is needed: 


I (1 + A2 + (1 + A)2)u 2 + 27(A + A2)r 


> 


f(u)u^ + f(v)X^u^ + f{u + u)(l + A)2 m2 + y/3^g{u)f{v)X'^u^ + g{v)f{u)Xu^ 

We can simplify by and estimate the functions and on the region |a;| < (5 (since |u| < |m|). 

9(1 +A +A2)+27(A +A2)-0 9(1 + A + A2) _ 9 


> 


^ + ^A2 + ^(l + A)2 + \/3(^7r^A2 + 7r^A)(5 7r2(l + A + A2) + ^/37^3(5 + VStt^S 


So it is enough to choose a S such that > q = c what S clearly exists since ^ For example, 

S = 0.03 works. If m G [—<5, 0] then analogous computation works: 


Q{u, Xu) > 


9(1 + A + A2) +27(A + A2) 


> 


7 r2 (l + A + A2) + V3( g(M)/(u)A2 + g{v)f{u)Xj ■ u 

27(A + A2) 


9(1 + A + A2)+27(A + A2).(-(5) J1_ _ 

“ 7r2 7r2(l + A + A2) 


'(1 + A + A2) +V3 ... 


0 


what is clearly bigger than ^ g = c if (9 • 0.9 • 4 — 27)(1 + A + A2) > 27 • 0.9 • 4(A + A2)(5 for all A G [0,1]. 
Equivalently, we need S < ^ what is clearly satisfied by <5 = 0.03. If A is negative and u G [0,5] then we can still 
assume that A > — I by symmetry. Moreover, the first estimations remain valid: 


Q{u, Xu) > 


9(1 + A + A2)+27(A + A2)- 


> 


> 


7 r2(l + A + A2) + V3ygiu)f{v)X‘^ + g{v)f{u)X) ■ u 
9(1 + A + A2)+27(A + A2)-0 ^ 9 


7 r^(l + A + A^) + v3(7r^l^ + 0*A) - u 


> 


VStt 


what is bigger than ^ g27^^ _ jf j ^ 27 0.0735 so 5 = 0.03 still works. 

9(1 + A + A2)+27(A + A2)-u 


0.9-47I- 27-^7r 

Similarly, if u G [—5, 0] and A G [—1,0] then 


Q{u, Xu) > - 

7 r2(l + A + A2) + y/3[g{u)f{v)X^ + g{v)f{u)X ] ■ u 

where one can observe that (A + A2) • u > 0 and A2 • u < 0 hence we get 

9(1 +A + A2)+27-0 , 9 9 


> 


> 


7r2(l + A + A2) +V3(0 + 7r^ • A 


> 


772 + ^2 + 

2(l_i) V3 


what is greater than ^ q 27^^ _ ^ jf j ^ « 0.0551 so the final S is 0.03. 

' Vs’"' 
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Let us denote by N and D the numerator and denominator of Q respectively. To give a bound on the derivative 
in the remaining region (outside of [—<5, <5]^), one has to estimate the following: 


|VQp = 



■ D - N ■ ^ ■ D - N ■ 


what is possible term-wise, namely one can find a constant K such that \dN ■ D\ + \N ■ dD\ < K ■ for both 
partial derivatives, since d^D, dyD and D can be estimated using the trivial bounds on /, g, f and g'. The only 

occurring problem is at ^ since D has a zero here but =lsowe can omit a neighborhood 

of this point. Unfortunately, the estimations are depending on the signs of u and v so it is a similar case by case 
analysis as before but in a lot easier context: K is only bounded by common sense, meaning that the lower K is the 
fewer concrete values we need to determine. This last task can be handled by any mathematical software package, 
but was essentially done in the background when we plotted the graph of the function. □ 
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